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DETAILED ACTION 
Response to Amendment 

1 . In response to the office action from 9/14/2006, the applicant has submitted an 
amendment, filed 12/14/2006, amending claims 1, 11-13, 15, 21, and 29, while arguing to 
traverse the art rejection based on the newly presented limitation regarding receiving a user 
profile, related to vocal information and at least one of a communication device, transducer, and 
a probability value ( associated with a probability of a user being in a particular noise 
environment, from a memory and compensating a speech recognition model based on the user 
profile (Amendment, Pages 11-12). The applicant's arguments have been fully considered but 
are moot with respect to the new grounds of rejection, necessitated by the amended claims, and 
further in view of Besling et al (U.S. Patent: 6,363,348). 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 
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3. Claims 1-4, 8-9, 13, 15, 18-19, 21-23, and 27-29 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Komoiiet al (U.S. Patent: 7,050,974) in view of Besling et al (U.S. 
Patent: 6,363,348). 

With respect to Claims 1, 13, and 21, Komori recites: 

A memory that stores user voice data and data related to at least one of a communication 
device, transducer, vocal information, and acoustic environmental data (model holding unit, Col. 
2, Lines 25-34; Col. 3, Lines 24-30; and adaptation data types, Col. 5, Lines 3-35); - 

A controller coupled with the memory that determines the data of the at least one 
communications device, transducer, vocal information, and acoustic environmental data and then 
compensates at least one speech recognition model to reflect the user data (speech recognition 
model adaptation unit, Col. 3, Lines 24-30; and Col 5, Lines 3-35); 

A communication device that receives speech utterances over a network (network 
communication means, Col. 2, Lines 10-14; Fig. 1, Element 300; and Abstract); and 

A speech recognizer that recognizes the speech utterances by using the at least one 
compensated speech recognition model (speech recognition unit, Fig. 1, Element 203; and 
speech recognition performed using adapted models, Col 4, Lines 61-66). 

Although Komori discloses a speech recognition adaptation system and method that 
adapts a speech recognition model based on user parameters including speaker and environment 
parameters, Komori does not recite a controller that receives a stored user profile and updates a 
stored recognition model based on the user profile. Besling, however, recites a retrieving means 
that retrieves a basic recognition model from memory, retrieves an adaptation profile identified 
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by a user ID (user profile), and, adapts a recognition model under control of the adaptation 
profile (Col. 7, Line 35- Col. 8, Line 55). 

Komori and Besling are analogous art because they are from a similar field of endeavor 
in speech recognition model adaptation. Thus, it would have been obvious to a person of 
ordinary skill in the art, at the time of invention, to modify the teachings of Komori with the 
adaptation model storage and retrieving means taught by Besling in order to implement more 
efficient speech recognition model storage that only requires storing one basic model of a given 
type and smaller adaptation profiles (Besling, Col. 4, Line 67- Col. 5, Line 2), 

With respect to Claims 2 and 22, Komori further recites: 

The transducer data includes a distortion value related to a transducer of a mobile 
communications device (portable telephone, Col. 1, Lines 17-24; Col 2, Lines 15-24; and 
microphone distortion, Col. 5, Lines 13-24). 

With respect to Claims 3 and 23, Komori further recites: 

The acoustic environmental data includes a background noise value that corresponds to 
an operating environment of a mobile communications device (portable telephone, Col. l y Lines 
17-24; Col 2 } Lines 15-24; and noise distortion, Col. 3, Lines 24-30; Col 5, Lines 3-12). 

With respect to Claim 4, Komori further recites: 
The vocal information includes a distortion value related to an end user associated with a mobile 
communications device (portable telephone, Col 1, Lines 17-24; Col. 2 y Lines 15-24; and 
speaker adaptation, Col. 5 } Lines 25-34). 
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With respect to Claims 8 and 27, Komori teaches speech recognition performed at a 
network server (Fig. 1, Element 200), while Besling recites the use of HMM speech models (Col. 
2, Lines 60-67). 

With respect to Claims 9 and 28, Komori teaches speech recognition performed at a 
network server system (Fig. 1, Element 200) having environment, speaker pronunciation, and 
microphone adaptation elements, and feature a storing means in communication with a speech 
recognizer (Col. 3, Lines 24-30; Col. 5, Lines 3-35; and Fig. 1). 

With respect to Claims 15 and 29, Komori recites: 

The acoustic environmental data is determined using at least one microphone in an end 
user's environment (microphone for speech input, Fig. 1, Element 101; and Col. 5, Lines 13-24). 

With respect to Claim 18, Komori discloses: 
The vocal information represents a variability that exists in vocal tract shapes among speakers of 
a group (speaker-specific adaptation data that would differentiate a particular speaker s voice 
from other speech recognition network users, Col. 5, Lines 25-34). 

With respect to Claim 19, Komori discloses: 
The controller communicates with a memory that stores various acoustic environmental models 
and various features of a specific type of mobile device (portable telephone, Col. 1, Lines 1 7-24; 
Col. 2, Lines 15-24; and model holding unit, Col. 2, Lines 25-34; Col. 3, Lines 24-30; and 
adaptation data types, Col. 5, Lines 3-35). 
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4. Claims 5, 6, and 25 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Komori et al in view of Besting et al and further in view of Kanevsky et al (U.S. Patent: 
6,442,519). 

With respect to Claim 5, Komori in view of Besting teaches the speech recognition 
model adaptation system utilizing microphone, speaker, and environmental noise data, as applied 
to Claim 1. Komori in view of Besling does not teach that the aforementioned data is provided 
by a personal computer, however Kanevsky teaches a personal computer used to receive speech 
data (Col. 4, Lines 18-46). 

Komori, Besling, and Kanevsky are analogous art because they are from a similar field of 
endeavor in speech recognition model adaptation. Thus, it would have been obvious to a person 
of ordinary skill in the art, at*the time of invention, to modify the teachings of Komori in view of 
Besling with the use of a personal computer for receiving speech data as taught by Kanevsky in 
order to expand the amount of speech data available for improved speech recognition by utilizing 
a personal computer connected to a network to receive speaker data (Kanevsky, Col. 3, Lines 10- 
35). 

With respect to Claims 6 and 25, Kanevsky further teaches a PDA for receiving speech 
data (Col. 4, Lines 18-46). 

5. Claims 7 and 26 are rejected under 35 U.S.C. 103(a) as being unpatentable over Komori 
et al in view of Besling and further in view of Hunt et al (U.S. Patent: 6,094,476). 

With respect to Claims 7 and 26, Komori in view of Besling teaches the speech 
recognition model adaptation system utilizing microphone, speaker, and environmental noise 
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data, as applied to Claims 1 and 21. Komori in view of Besling does not teach that the 
aforementioned data is provided through a satellite communications system, however Hunt 
teaches such a satellite communications system (Col. 4, Lines 16-23). 

Komori, Besling, and Hunt are analogous art because they are from a similar field of 
endeavor in speech recognition model adaptation. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Komori in view of 
Besling with the use of a satellite communications system as taught by Hunt in order to provide a 
practical variation of a cellular phone network that allows a user to access voice mail through 
recognized speech commands (Hunt, Col 4, Lines 16-33). 

6. Claim 10, 17, and 30-31 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Komori et al in view of Besling et al and further in view of Heck et al (U.S. Patent: 5,950,157). 

With respect to Claim 10, Komori in view of Besling teaches the speech recognition 
model adaptation server system utilizing microphone, speaker, and environmental noise data, as 
applied to Claim 8. Komori in view of Besling does not specifically teach a means of updating a 
speaker model to reflect a specific type of communications device, however Heck teaches such 
an updating means (Col 9, Line 30- Col 10, Line 48). 

Komori, Besling, and Heck are analogous art because they are from a similar field of 
endeavor in speech recognition model adaptation. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Komori in view of 
Besling with the speaker recognition model adaptation means taught by Heck in order to 
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implement a robust speaker recognition system that can function in the presence of handset 
mismatches (Heck, Col. 2, Lines 5-9). 

With respect to Claims 17 and 30, Komori in view of Besling teaches the speech 
recognition model adaptation system utilizing microphone, speaker, and environmental noise 
data, as applied to Claim 13. Komori in view of Besling does not specifically suggest that the 
microphone (transducer) data is a distortion value based on a difference between an actual 
transducer and a response characteristic of a training transducer, however Heck teaches such a 
distortion value that relates to transducer data (Col. 10, Lines 9-48). 

Komori, Besling, and Heck are analogous art because they are from a similar field of 
endeavor in speech recognition model adaptation. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Komori in view of 
Besling with the transducer distortion scores taught by Heck in order to implement a robust 
speaker recognition system that can function in the presence of handset mismatches (Heck, Col. 
2, Lines 5-9). 

With respect to Claim 31, Komori in view of Besling teaches the speech recognition 
model adaptation system utilizing microphone, speaker, and environmental noise data, as applied 
to Claim 21. Komori in view of Besling does not specifically teach a means of updating a 
speaker model to reflect a specific type of communications device, however Heck teaches such 
an updating means (Col 9, Line 30- Col. 10, Line 48). 

Komori, Besling, and Heck are analogous art because they are from a similar field of 
endeavor in speech recognition model adaptation. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Komori in view of 
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Besling with the speaker recognition model adaptation means taught by Heck in order to 
implement a robust speaker recognition system that can function in the presence of handset 
mismatches (Heck, Col 2, Lines 5-9). 

7. Claims 11-12, 20, and 32 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Komori et al in view of Besling et al and further in view of in view of Cilurzo et al (U.S. Patent: 
6,434,526). 

With respect to Claim 11, Komori in view of Besling teaches the speech recognition 
model adaptation server system utilizing microphone, speaker, and environmental noise data, as 
applied to Claims 1 and 21. Komori in view of Besling does not teach personal user account 
administrative information, however Cilurzo teaches such account information (Col 5, Lines 27- 
64). 

Komori, Besling, and Cilurzo are analogous art because they are from a similar field of 
endeavor in speech recognition. Thus, it would have been obvious to a person of ordinary skill 
in the art, at the time of invention, to modify the teachings of Komori in view of Besling with the 
user account information taught by Cilurzo in order to provide multiple users with access to a 
network speech recognizer having a capacity that can be expanded dynamically (Cilurzo, Col 2, 
Lines 22-51). 

With respect to Claims 12 and 32, Komori in view of Besling teaches the speech 
recognition model adaptation system utilizing microphone, speaker, and environmental noise 
data, as applied to Claims 1 and 21. Komori in view of Besling does not teach the ability to 
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select a specific speech recognition network, however Cilurzo teaches such a selection ability 
(Col. 5, Lines 4-26), 

Komori, Besling, and Cilurzo are analogous art because they are from a similar field of 
endeavor in speech recognition. Thus, it would have been obvious to a person of ordinary skill 
in the art, at the time of invention, to modify the teachings of Komori in view of Besling with the 
ability to select a specific speech recognition network as taught by Cilurzo in order to provide 
multiple users with access to a speech recognizer having a capacity that can be expanded 
dynamically (Cilurzo, Col 2, Lines 22-51). 

With respect to Claim 20, Cilurzo teaches the user account information as applied to 
claim 11. 

8. Claim 14 is rejected under 35 U.S.C. 103(a) as being unpatentable over Komori in view 
of Besling and further in view of Ranzino (U.S. Patent: 6,281,811). 

With respect to Claim 14, Komori in view of Besling teaches the speech recognition 
model adaptation system utilizing microphone, speaker, and environmental noise data, as applied 
to Claim 13. Komori in view of Besling does not teach the identification of a user device 
according to an RF ID tag, however Ranzino teaches such a means for user device identification 
(RF ID, Col. 4, Lines 31-42). 

Komori, Besling, and Ranzino are analogous art because they are from a similar field of 
endeavor in speech recognition systems. Thus, it would have been obvious to a person of 
ordinary skill in the art, at the time of invention, to modify the teachings of Komori in view of 
Besling with the RF ID tag taught by Ranzino in order to provide a means for easily identifying a 
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particular user to communicate information related to that user's preferences (Ranzino, Col. 1, 
Lines 56-61; and Col. 4, Lines 31-42). 

9. Claim 16 is rejected under 35 U.S.C. 103(a) as being unpatentable over Komori et al in 
view of Besling et al and yet further in view of Byers (U.S. Patent: 6,219,645). 

With respect to Claim 16, Komori in view of Besling teaches the speech recognition 
model adaptation system utilizing microphone, speaker, and environmental noise data, as applied 
to Claim 13. Komori in view of Besling does not specifically suggest a plurality of microphones 
that are initiated as an end user walks in between the microphones, howiever Byers teaches such a 
plurality of microphones (Col. 3, Lines 11-35; Col. 4, Line 66- Col. 5, Line 12; and Col. 12, 
Lines 30-56). 

Komori, Besling, and Byers are analogous art because they are from a similar field of 
endeavor in speech recognition. Thus, it would have been obvious to a person of ordinary skill 
in the art, at the time of invention, to modify the teachings of Komori in view of Besling with the 
multiple microphone speech recognition system taught by Byers in order to allow a user to 
control multiple ASR devices while providing mobility through a room or environment (Byers, 
Col. 1, Line 65- Col 2, Line 7). 

10. Claim 24 is rejected under 35 U.S.C. 103(a) as being unpatentable over Komori et al in 
view of Besling et al and yet further in view of Sonmez et al (U.S. Patent: 5, 745,872). 

With respect to Claim 24, Komori in view of Besling teaches the speech recognition 
model adaptation system utilizing microphone, speaker, and environmental noise data, as applied 
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to Claim 13. Although Komori recites receiving adaptation data from a wireless telephone 
(wireless telephone, Col 1, Lines 17-24; Col 2, Lines 15-24; and multiple data types, Col 5, 
Lines 3-35), Komori does not specifically suggest that the wireless telephone is a cellular 
telephone, however Sonmez teaches speech recognition model adaptation using a cellular phone 
(Col 2, Lines 1-7). 

Komori, Besling, and Sonmez are analogous art because they are from a similar field of 
endeavor in speech recognition model adaptation. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Komori in view of 
Besling with the use of speech recognition model adaptation for a cellular phone as taught by 
Sonmez in order to adapt speech data to changing cellular phone environments (Sonmez, Col 2, 
Lines 1-7 and 29-36). 

Conclusion 

1 1 . Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
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CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

12. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: 

Gould et al (U.S. Patent: 6,839,669)- teaches a speech recognition method that utilizes 
speaker profiles. 

13. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (571) 272-7632. 
The examiner can normally be reached on M-Th, 7:30-5:00, F, 7:30-4, Off Alternate Fridays. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached at (571) 272-7603. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). . 



James S. Wozniak 
2/27/2007 




